Tree Dependent Identically Distributed Learning
نویسندگان
چکیده
We view a dataset of points or samples as having an underlying, yet unspecified, tree structure and exploit this assumption in learning problems. Such a tree structure assumption is equivalent to treating a dataset as being tree dependent identically distributed or tdid and preserves exchangeability. This extends traditional iid assumptions on data since each datum can be sampled sequentially after being conditioned on a parent. Instead of hypothesizing a single best tree structure, we infer a richer Bayesian posterior distribution over tree structures from a given dataset. We compute this posterior over (directed or undirected) trees via the Laplacian of conditional distributions between pairs of input data points. This posterior distribution is efficiently normalized by the Laplacian’s determinant and also facilitates novel maximum likelihood estimators, efficient expectations and other useful inference computations. In a classification setting, tdid assumptions yield a criterion that maximizes the determinant of a matrix of conditional distributions between pairs of input and output points. This leads to a novel classification algorithm we call the Maximum Determinant Machine. Unsupervised and supervised experiments are shown.
منابع مشابه
A Note on the Strong Law of Large Numbers
Petrov (1996) proved the connection between general moment conditions and the applicability of the strong law of large numbers to a sequence of pairwise independent and identically distributed random variables. This note examines this connection to a sequence of pairwise negative quadrant dependent (NQD) and identically distributed random variables. As a consequence of the main theorem ...
متن کاملHigher moments portfolio Optimization with unequal weights based on Generalized Capital Asset pricing model with independent and identically asymmetric Power Distribution
The main criterion in investment decisions is to maximize the investors utility. Traditional capital asset pricing models cannot be used when asset returns do not follow a normal distribution. For this reason, we use capital asset pricing model with independent and identically asymmetric power distributed (CAPM-IIAPD) and capital asset pricing model with asymmetric independent and identically a...
متن کاملSimulation of (M1, M2)-dependent random fields with K-distributed marginals
Amethod to simulate a two-dimensional (m1,m2)-dependent random field Y with K-distributed marginals is presented. The simulation starts with a random field with independent and identically standardized normally distributed elements. Then a (m1,m2)-dependent matrix is calculated using weighted sums. It has identically standardized normally distributed marginals. From this matrix the desired rand...
متن کاملTime Series Models
Overview In contrast to the classical linear regression model, in which the components of the dependent variable vector y are not identically distributed (because its mean vector varies with the regressors) but may be independently distributed, time series models have dependent variables which may be identically distributed, but are typically not independent across ovbservations. Such models ar...
متن کاملIntroduction to Time Series Models
Overview In contrast to the classical linear regression model, in which the components of the dependent variable vector y are not identically distributed (because its mean vector varies with the regressors) but may be independently distributed, time series models have dependent variables which may be identically distributed, but are typically not independent across ovbservations. Such models ar...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005